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^ ; ABSTRACT 

We describe the current status of CATS (astrophysical CATalogs Support system), a publicly ac- 
cessible tool maintained at Special Astrophysical Observatory of the Russian Academy of Sciences 
(SAO RAS) (http://cats.sao.ru) allowing one to search hundreds of catalogs of astronomical ob- 
jects discovered all along the electromagnetic spectrum. Our emphasis is mainly on catalogs of 
radio continuum sources observed from 10 MHz to 245 GHz, and secondly on catalogs of objects 
such as radio and active stars, X-ray binaries, planetary nebulae, HII regions, supernova rem- 
nants, pulsars, nearby and radio galaxies, AGN and quasars. CATS also includes the catalogs 
from the largest extragalactic surveys with non-radio waves. In 2008 CATS comprised a total of 
about 10 9 records from over 400 catalogs in the radio, IR, optical and X-ray windows, including 
most source catalogs deriving from observations with the Russian radio telescope RATAN-600. 
CATS offers several search tools through different ways of access, e.g. via web interface and 
e-mail. Since its creation in 1997 CATS has managed about 10 5 requests. Currently CATS is 
used by external users about 1500 times per day and since its opening to the public in 1997 has 
received about 4000 requests for its selection and matching tasks. 
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Q\ • 1 INTRODUCTION 

O 

Vast amounts of astrophysical information are now being published, based on observations of 
small and large sky regions. Typically, this information includes coordinates of the observed ob- 
jects and their physical characteristics in the form of source catalogs. In fact, almost every new 
large-scale observational experiment produces a new catalog of objects. Modern astrophysics 
operates with source parameters obtained in different wavelength ranges with the goal of ob- 
taining the most detailed understanding of physical properties and the processes of radiation of 
these objects. The ability of using different catalogs makes this problem considerably simpler. 

Over the past decades several different attempts have been made to combine large numbers of 
astronomical catalogs and make them accessible in a unified way, which can be classified grossly 
into two categories: databases of objects and catalog browsers. Examples of the former are NED 
(Helou et al., 1990; Mazzarella et al., 2002), Simbad (Egret, 1983; Wenger et al., 2000), and 
LEDA (Paturel et al., 1997). Examples of the latter are Vizier (Ochsenbein et al., 2000) and 
Datascope (http:/ /heasarc. gsfc.nasa.gov/vo/). 

Motivated by RATAN-600 observation requirements, problems of radio source study, and the 
underrepresentation of radio source catalogs in the then existing object databases, the present 
authors decided, in 1995, to create CATS, the astrophysical CATalogues Support system. Follow- 
ing the radioastronomical needs, the architecture of the operating system and the considerations 
expressed in the reviews by Andernach (1990, 1994, 1999), we chose to design CATS as a catalog 
browser rather than an object database, given that it would allow us to achieve a much better 
completeness in terms of the number of records, implying e.g. a more complete coverage of ra- 



dio source measurements over the entire radio frequency window. We thus deliberately delayed 
the known problem of cross-identification of all the catalogs, as provided in NED, SIMBAD, or 
LEDA, to a later stage. In that sense CATS is ideally suited for the experienced researcher who 
is looking for the largest amount of data available, but willing and able to work out the correct 
cross-identifications by himself. This effort will be compensated for by more complete data than 
those obtained from other existing object databases. 

The first steps of the creation of CATS were described by Verkhodanov and Trushkin (1994, 
1995a,b), Verkhodanov et al. (1997, 2000), and Trushkin et al. (2000). CATS allows one to 
operate with catalogs stored in ASCII and FITS binary tables, to fit radio continuum spectra, 
and to calculate spectral indices obtained from different radio catalogs. This service is located 
on the server 'cats.sao.ru' at SAO, Russia, and currently runs on a Dual Opteron 244 under OS 
Linux Fedora Core 6. 

2 IMPLEMENTATION OF CATS 

There are many modern databases operating with standard software that allow a user to obtain 
information on celestial objects, their images at different wavelengths, and the corresponding 
bibliographic data. The above-mentioned services like NED, SIMBAD, and LEDA are among 
these. Other services, which may be called "virtual telescopes" , offer extractions from large, ded- 
icated imaging surveys, such as the Sloan Digital Sky Survey (SDSS) at www.sdss.org, 2MASS 
at www.ipac.caltech.edu/2mass/, and SkyView at skyview.gsfc.nasa.gov make use of modern 
Web-technologies allowing one to select sky regions either via coordinates or object name re- 
solvers. We developed our own simple approach, following both the RATAN-600 observational 
tasks for the preparation of radio source spectra, as well as users' desire to be able to select 
observing targets from available source catalogs. Based on simple system commands and basic 
functions in C, our approach permitted us to achieve the fast speed of source selection and easy 
portability among different Unix-like systems. Moreover, this approach was better suited to our 
very limited resources as it allowed easier integration of new catalogs. We believe that our best 
contribution is a larger number of searchable catalogs rather than a more sophisticated search 
interface with less accessible data. We also wish to emphasize that we are not trying to simply 
duplicate other catalog browsers by using their sets of catalogs. A major fraction of these has 
been obtained or prepared by members of our team (see e.g. Andernach 2009), and over 100 
radio source lists available from CATS are not available from VizieR. 

The present data collection consists of catalogs, their descriptions, and corresponding programs 
operating under OS Linux. The program codes are created in the C language and translated 
with the GNU project C compiler. The codes are freely shared, provided they are used for 
non-commercial goals. 

New catalogs may be added to the system in conformity with the following rules: 

1) Every new catalog of objects should be placed in the UNIX directory with the same name as 
the catalog of objects; 

2) The description of the catalog should also be placed in that directory; 

3) The programs (or operating scripts) for local operations with the catalog of objects are also 
placed in the same directory; and 

4) Brief characteristics, program names and description file of the astrophysical catalog are 
stored in a file named 'cats_descr'. 

The following information is stored in the description file 'cats_descr': 

• the name of the catalog, which coincides with the name of the UNIX directory, 

• the type of the catalog (radio, optical, combined, etc.), 

• frequency /wavelength range, 



• minimum fluxes or magnitudes, 

• equatorial or /and galactic coordinate ranges of the repective catalog, 

• names of the local programs for the 'select' and 'match' functions (see Section 3.2), 

• name of the document file, 

• number of records in the catalog, 

• the size of the beam pattern or angular resolution, 

• a recalibration factor (if available) to put the flux densities on a commonly agreed flux scale, 
and 

• a reference. 

Parameters from the description file are used by the programs that process the input data, e.g., 
user-specified limits in coordinates and/or flux density inform the operating programs about the 
relevant sky zones to be searched and thus economize on processing time if a given catalog is 
out of the required range. 

The lower level of the CATS control system includes several basic utilities: 

(1) 'c_sel' selects objects with parameters within the user-specified ranges; 

(2) 'c_match' looks for objects falling within a certain distance from a user-specified coordinate 
(a "cone search"); 

(3) 'cats_divide' operates with catalogs sorted by right ascension and produces an index of record 
numbers corresponding to a certain right ascension; 

(4) 'epoch' converts the user-specified coordinates if they are provided for non-standard equinox, 
i.e. neither 1950.0 nor 2000.0 (proper motions are assumed to be zero); 

(5) catalog coordinates stored according to the equinox as published, and user-specified coordi- 
nates are converted to that equinox; 

(6) 'cats_base' controls the file of descriptions 'cats_descr' selecting characteristics of catalogs 
and produces input parameters for the programs 'c_seP and 'c_match.' 

For catalogs in FITS binary table format (e.g., NVSS or SDSS catalogs), special programs for 
the 'select' and 'match' functions were prepared. 

To organize access of a local user of CATS from any directory of the OS, the operation programs 
that process the main CATS tasks are placed in a commonly accessible directory of the OS 
Linux. The control procedures 'cats_sel' and 'cats_match,' organized as shell-scripts, cover all 
the low-level programs and provide the interface between CATS and a local user, or requests 
made via HTTP or e-mail. The described method of the catalog storage facilitates the database 
development, its expansion with new data, and the possibility to tune the supporting programs. 

CATS has its own indexing procedure 'cats_divide' for object coordinates in a catalog. The 
program decides where to start searching records of the input catalog from an index prepared 
from a right- ascension sorted list, allowing one to make efficient use of hard-disk seeking func- 
tions. To avoid problems of searching within a few degrees of the poles where recalculation from 
one equinox to another one is done, we process the total list of objects in these zones to find a 
required source. The indexing of CATS lists by declination is now under consideration. 

3 THE MAIN FUNCTIONS 

The following functions are currently implemented in CATS: 

(1) Selection of objects from one or several catalogs by the following criteria: equatorial or 
galactic coordinates, flux densities, spectral indices, frequencies, names, and (for some catalogs) 
the type of objects. Parameters common to all catalogs (coordinates and flux density) are used 
in the selection. 

(2) Search for counterparts of objects (selected from one or several catalogs) by coordinate 



(3) Cross-identification of different catalogs. This procedure is currently available only for local 
users. It uses the output from the selected zones of one catalog as input for the matching 
procedure on another catalog. 

(4) Preparation of a file with a short description and characteristics of each catalog, printing of 
the total list of catalogs for the required sky areas (for local user). 

(5) Plotting radio spectra of selected sources. This can be done from multi-frequency catalogs, 
or catalogs prepared by the CATS team from cross-identification of radio catalogs at different 
frequencies, or even by individual user input of frequencies and flux densities for a source (at 
cats . sao .ru/~satr/SP/spectrum.html). This function is realized in two procedures. The first 
is a Java script for data in the homogenized CATS format for single objects, accessible from the 
web page. The second one is suitable for object lists in the local RATAN-600 data processing 
system. 

The result of the object selection is a datafile sorted according to different object characteristics, 
such as right ascension (default mode) , declination or frequency. This file can either be displayed 
on the standard output devices or sent to the user in the following formats: 

(1) The 'native' (original) format of the catalog (i.e. columns as published). 

(2) Standard ("homogenized") output format. This format is organized to be common for all the 
catalogs and used later for unification of data, preparation of spectra and statistical analysis. 
The standard FITS Table format describing data and fields of the table may be added as a 
header of the resulting file. 

3.1 Access to CATS 

Different modes of access are provided according to user requirements and CATS goals, following 
modern trends of software development. Thus, three main modes of on-line access to the CATS 
service have been prepared: 

(1) Dialog mode (non-graphics) has been maintained until the present. Several scripts written 
in the UNIX 'shell' language have been created (Verkhodanov and Trushkin 1994). They permit 
one to operate with CATS' supporting programs via TCP/IP and NFS protocols in the local 
computer network. 

(2) Hypertext access (http://cats.sao.ru) is provided to allow a user from the Internet system 
to operate with CATS via a hypertext transfer protocol. It allows one to execute all described 
operations from the Web-page. 

(3) FTP access (ftp:/ /cats. sao.ru) allows a user to obtain both the description of CATS catalogs 
and the catalogs themselves. Here is an example of access by anonymous FTP to the catalog 
WISH: 

anonymousOf tp : / / cats . sao . ru 

ftp> bin 

ftp> cd pub/cats/ 

ftp> cd WISH 

ftp> get wishll.cat 

ftp> bye 

All the catalog names are stored in a file of descriptions 'cats_descr' (ftp:/ /cats. sao.ru/cats_descr). 

(4) E-mail access allows the user to send batch requests to CATS. 

One can send an e-mail with requests of matching with lists or selection by several parameters. 
The e-mail will be read automatically and sent for execution to the CATS scripts. The result 
will be sent back automatically to the user via e-mail. 

3.2 'Select' and 'match' procedures 

Two main procedures of data selection have been provided at different levels of CATS control: 
the 'select' and 'match' tasks. They follow the first three main functions described above and 
are realized in different access modes. As was described earlier, the two low-level nrocednres 



'c_sel' and c_match' provide information according to the corresponding user requests and pass 
it to the upper-level control scripts 'cats_seP and 'cats_match'. Using these two procedures, the 
cross- identification of catalogs is possible by using output from the 'select' function of the first 
complete catalog as the input of the 'match' function of the second catalog or catalogs. 



3.3 



E-mail access 



In order to economize user's real time and avoid delays in on-line data exchange, we provide 
the possibility to submit a batch task in the form of an e-mail letter. The e-mail requests may 
have several formats (as explained in a file that is returned on an empty e-mail to cats@sao.ru, 
with no subject). Two examples of the body of an e-mail (no subject required) are shown below. 



(1) selection task: 

mail -s "" cats@sao.ru 
cats select 

ra min=10:30 max=10 : 40 : 00 . 
dec > 10' < 12' 30" 
flux > . 5mJy 
catalogs r epoch=1950 
cats end 



(2) matching task: 

mail -s "" cats@sao.ru 
cats match 
catalogs NVSS, FIRST 
window box x=30" y=10' 
sources : 

si 02:02:00 +31:23:16 1950 
s2 02:23:10 34:03:00 1950 
s3 21:26:33.9 -18:34:33.0 1950 
cats end 



These examples demonstrate the use of some keywords for batch requests. The beginning and 
end of the requests are defined by the statements: 'cats start' and 'cats end'. The first ex- 
ample shows how to search within certain limits of coordinates by using the 'min', 'max' or 
'<', '>' operators for right ascension 'ra' and declination 'dec'. The keywords expression 'cat- 
alogs r' sets the type of catalogs to be used for selection, where 'r' means all the radio cat- 
alogs. Instead of 'r', one may choose 'o' for the optical catalogs, 'x' for the X-ray ones, 'ir' 
for the infrared ones, or just the names of used lists separated with comma, e.g. 'catalogs 
NVSS, FIRST, WISH' (the full list of the CATS catalogs can be obtained from the CATS web 
page http://cats.sao.ru/doc/CATS_list.html). The keyword 'epoch' sets the equinox of the input 
coordinates. 'Flux' sets flux limits. 

The second example of the e-mail task is the matching procedure. There are some additional 
keywords 'window box x=30" y=10', where 'box' is the type of matching window (others are 
'circle' and 'ellipse'), 'x' and 'y' are along longitude and latitude of the corresponding coordinate 
system, respectively. Note that 'x' and 'y' are half the side lengths of the search 'box,' the value 
following the 'circle' keyword is its radius, and those following the 'ellipse' keyword are the 
semi-major and semi-minor axes of the ellipse. 



4 THE MAIN CATALOGS 

The major source of the radio catalogs in CATS is the collection one of us (Andernach, 1990, 
1999), who has spent considerable effort to recover older source lists not previously available in 
electronic form, using a scanner and optical character recognition software (Andernach, 2009). 
This collection is complemented in several ways: contributions from authors, astro-ph preprints, 
tables from electronic journals and the CDS catalog archive, as well as occasional manual re- 
typing of the original catalog. About 70 catalogs have been typed and/or corrected manually 
at SAO RAS by S.Trushkin. The largest catalogs (e.g. NVSS, FIRST, SDSS, etc.) were copied 
from web sites of the catalog authors. 



CATS is mainly a radio-astronomical catalog service. All major catalogs incorporated into CATS 



Table 1. Major radio astronomical catalogs available in CATS 
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Table 2. Some catalogs of other wavelength ranges in CATS 
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CATS also contains observational and combined catalogs of various Galactic and extragalac- 
tic objects unified in the combined tables described in Verkhodanov et al. (2005) where the 
references to all the mentioned catalogs can be found. 

5 SUMMARY 

CATS provides a simple and convenient access to astrophysical data and complements the 
data available from other services, most notably for radio continuum flux densities, for which 
it provides the largest amount of data in any such service. Operation with CATS permits 
astronomers to search for peculiar objects and study physical processes in sources of cosmic 
radiation. 

Until October 2004, we registered over 28000 requests for the 'select' or 'match' procedures, 
which are the most popular. The most popular catalogs for FTP-copying over the last five years 
are QSO by Hewitt and Burbidge (20 times), PGC (19 times), and NVSS (18 times). CATS 
processes daily up to 1000 HTTP-requests for information concerning the catalog descriptions. 

CATS is being expanded continuously and presently comprises more than 400 catalogs including 
all the RATAN-600 catalogs. In its present form CATS occupies about 60Gb of disk space, which 
makes its installation possible also on notebooks. One of the possible future developments of 
the system is the preparation of the VOTable format for output tables. At present, limited 
resources preclude linkage with the Virtual Observatory (VO). 
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